Picture for Lifu Huang

Lifu Huang

UC Davis

StagePilot: A Deep Reinforcement Learning Agent for Stage-Controlled Cybergrooming Simulation

Add code
Feb 04, 2026
Viaarxiv icon

Adversarial Reward Auditing for Active Detection and Mitigation of Reward Hacking

Add code
Feb 02, 2026
Viaarxiv icon

TokenSeek: Memory Efficient Fine Tuning via Instance-Aware Token Ditching

Add code
Jan 27, 2026
Viaarxiv icon

From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models

Add code
Jan 22, 2026
Viaarxiv icon

Zero-shot adaptable task planning for autonomous construction robots: a comparative study of lightweight single and multi-AI agent systems

Add code
Jan 20, 2026
Viaarxiv icon

Navigating Ideation Space: Decomposed Conceptual Representations for Positioning Scientific Ideas

Add code
Jan 13, 2026
Viaarxiv icon

MiLDEdit: Reasoning-Based Multi-Layer Design Document Editing

Add code
Jan 08, 2026
Viaarxiv icon

How Do Large Language Models Learn Concepts During Continual Pre-Training?

Add code
Jan 07, 2026
Viaarxiv icon

SuperFlow: Training Flow Matching Models with RL on the Fly

Add code
Dec 17, 2025
Viaarxiv icon

Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures

Add code
Nov 17, 2025
Figure 1 for Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures
Figure 2 for Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures
Figure 3 for Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures
Figure 4 for Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures
Viaarxiv icon